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Title: In TT ivo Recombination 

Field of the invention 

The present invention relates to a method for in vivo 
5 recombination of homologous DNA sequences. The method is a 

forced artificial evolution resulting in a DNA sequence encoding 
a polypeptide having an advantageous property. 

Back ground of the invention 

10 Homologous recombination between DNA sequences placed on 

the same plasmid is well known to the skilled artisan (Weber and 
Weissmann, Nucl. Acid. Res. (1983) .11, 5661-5669). EP 252666 Bl 
(Novo Nordisk A/S) , W095/22625 Al (Affymax Technologies N.V.) 
and WO9101087 discloses in vivo recombination of genes placed on 

15 the same plasmid. 

EP 449923 Bl (Setratech) discloses a process of inter- 
genic recombination in vivo of partially homologous DNA se- 
quences. The recombination takes place in cells of which the en- 
zymatic mismatch repair system is defective. 

20 J. Biotechnol. (1991) 19, 221-240 discloses a process 

for inactivating a gene in the genome of B. Amyloliquefaciens. 
The process involves integration and excision of a pE194 mutant. 

In Chemical Abstracts 118:161925 (1993) the minimal se- 
quence length of homology necessary for recombination is deter- 

25 mined. The method of determination involves integration of a 
pE194 derivative carrying a gluconase gene. 

In J. Bacterid. (1932) 152 , 524-526 a plasmid pBD9 , 
which comprises the two plasmids from Staphylococcus aureus, 
pE194 and pUBHO, is disclosed. Other plasmids based on pE194 or 

30 pUBHO are disclosed in Molecular and General Genetics (1988) , 
213, 465-70, Molecular and General Genetics (1984)', 195, 374-7, 
Plasmid (1981), 6, 67-77 and Genetika, (1986) 22, 2750-7. 

In Yichuan Xuebao (1993), 20, 272-8 (see Chemical Ab- 
stracts 120:1670 (1994)), Chen et . Al . disclose the plasmid 

35 pNV7102, a pE194 derivative, in which the thermostable a-amylase 
gene from Bacillus licheniformis is inserted. pNW102 was trans- 
formed into the Bacillus subtilis strain BF 7658, followed by 
incubation at non-permissive temperature, allowing homologous 
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recombination between the a-amylase genes of P NW102 and BF 7658. 
The a-amylase produced of the recombinant strains shows the same 
characteristics as the a-amylase from Bacillus licheniformis . 

However, in the related art there is no indication of 
5 that DNA sequences can be recombined in vivo by repeating inte- 
gration and cross-out in order to obtain a gene encoding a 
polypeptide with improved characteristics, such as increased 
stability, improved specificity or higher activity. 

Accordingly, a problem of the invention is to provide an 
io improved in vivo process of generating new DNA sequences. 

Rripf Description of t h* invention 

The present invention is directed to a in vivo process 
of generating new DNA sequences. It has turned out that new DNA 
is sequences with improved properties can be generated in a fast 
and efficient way from at least two homologous DNA sequences by 
a process involving in vivo exchange of DNA between at least two 
vectors, each containing a homologous DNA sequence and an origin 
of replication. 

20 The process of the invention is characterized by, that 

said exchange of said homologous sequences is done by repeating 
in vivo at least once, said integration and excision of one vec- 
tor into the other. 

Accordingly, in a first aspect the invention relates to 

25 a process for in vivo recombination of homologous DNA sequences, 
comprising the following steps 

(a) incubating a cell containing at least two DNA struc- 
tures, each comprising a DNA sequence and an origin of 
replication, under conditions which favour integration 

30 0 f on e of the DNA-structures into one of the others, 

thereby forming a hybrid DNA structure; 

(b) incubating the cell under conditions which favour cross- 
ing out from said hybrid DNA structure, thereby forming 
novel DNA structures, each comprising a recombined DNA 

35 sequence and an origin of replication; 

( C ) repeating steps a)-b) at least once. 
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One of the advantages of the in vivo recombination 
process of the invention is by repeating the exchange in vivo 
between the DNA sequences it is possible to obtain numerous dif- 
5 ferent recombination patterns between said DNA sequences. 

This is illustrated in figure 1 herein, which shows the 
result of in vivo recombination between the two DNA sequences 
Savinase and Savisyn performed as described in example 1. 

In figure 1 Savinase/Savisyn recombined clones such as 
10 clone no. 3, 8 12 are derived from more than one in vivo recom- 
bination event between the two DNA sequences. 

Brief description of the drawing 

Figure 1 illustrates the recombination of two DNA se- 
15 quences, placed on the plasnids pTR, temperature resistant since 
origin of replication is functional at 50°C / and pTS , tempera- 
ture sensitive since origin of replication not functional over 
45°C Under non-pernissive conditions (i.e. at 50°C) , pTS inte- 
grates into pTR, forming a hybrid plasmid. 
20 Figure 2 illustrates the cross out from the hybrid plas- 

mid, forming two new DNA sequences, each placed on a plasmid. 
Repeated crossing in and out may be done by e.g. temperature cy- 
cling or simply by keeping temperature pressure and selecting 
for resistance of pTS (i.e. Errr") . 
25 Figure 3 illustrates the plasmids which are obtained af- 

ter repeated integration and crossing out events. 

',v Further, figures 1 and 2 illustrate a DNA structure sys- 
tem suitable for in vivo measurement of hybrid structure forma- 
tion, and suitable for performing a forced cross out of the DNA 
30 hybrid structure. 

Fig. 1: DNA structure pTS : 

Suitable active promoter directing ex- 
pression of active protein 
Erm R : antibiotic resistance gene 
35 DNA structure pTR comprises: 

Not any active promoter to direct tran- 
scription of GFP 
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GFP: Green Fluorescent Protein 
The Erm R antibiotic resistance gene in pTS is active due 

to the active promoter. 

The GFP gene in pTR is inactive (i.e. the gene is non- 
5 transcribed) due to there is not an active promoter to drive ex- 

9 

pression of the gene. 

Fig. 2: After formation of the hybrid structure accord- 
ing to the invention GFP is now active (driven by the promoter) . 
This makes it possible to in vivo measure the hybrid structure 
10 formation (see below for further details) . 

in contrary the Erm R antibiotic resistance gene is now 
inactive (i.e. the gene is non-transcribed) due to there is not 
an active promoter to drive expression of the gene. Forced cross 
out of the DNA hybrid structure is performed by incubating the 
is cells under Erm R antibiotic pressure, thereby forcing a cross 
out of the DNA hybrid structure, since the cell is only able to 
express the Erm R antibiotic resistance gene after the crossing 
out of the DNA structure has taken place. 

Figures 4 and 5. Illustrate the two vectors pSX120 (fig. 
20 4) and P MB430 (fig. 5) used as described in example 1 to in vivo 
reco-bine the two DNA sequences Savinase and Savisyn. 

PSX120 comprises i) an open reading frame encoding Sav- 
inase ii) an active promoter directing transcription of Sav- 
inase', and iii) a gen. conferring resistance to chloramphenicol 

^ } ' pMB430 conorises i) a Savizyn DNA fragment, ii) a termi- 
nator (resulting in no transcription through the regions follow- 
ing this) , iii) a gene conferring resistance to erythromycin 
(£rj /) , and iv) an open reading frame encoding Green Fluorescent 

30 Protein (GFP) . 

Fig. 6: Illustrates the results of Savinase/Savisyn an 

vivo recombination performed as described in example 1. 

The schematic representation shows 12 of the shuf- 
f led/recombined clones. 
35 ' Horizontally are given the positions of the 

restriction sites unique for Savizyn. 

The letter "S" denotes that the sequence at this position 
is identical to the Savinase sequence and the letter »Z« denotes 
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that the sequence stems from Savizyn.. All of the sequenced 
clones are different in their pattern of recombination. 

pef initions 

5 As used herein, the following terms have the following 

meanings: 

The term M DNA sequence 11 includes any DNA sequence which 
is desirable to modify according to the invention e.g. a DNA se- 
quence encoding, a polypeptide, for example an enzyme, pharma- 

10 ceutically active polypeptides, e.g. insulin growth hormone hu- 
man hormones or growth regulators, and sequences such as promot- 
ers, transcription or translation .regulators and other intracel- 
lular polynucleotides and polypeptides. 

The term "homologous" means the DNA sequences have some 

15 stretches of identical nucleotides, allowing recombination in 
vivo. Said stretches within the DNA sequences are at least 15 
base pair long, more preferably a: least 25 base pair long, even 
more preferably at least 50 base long", and most preferably at 
least 150 base pair or more. 

20 The term "homologous' 1 comprises stretches of DNA 

sequences that are from 60% identical to DNA sequences that 
differs in only one nucleotide. It is preferred that the 
stretches of DNA sequences are more than 70% identical, more 
preferably at least 30%, especially more than 90% identical, and 

25 more preferred that the stretches of DNA sequences are 95% 
identical, most preferred at least 97%. 

Within the stretches of the DNA sequences of interest,, 
the DNA sequence homology referred to above is determined as the 
degree of identity between two sequences indicating a derivation 

30 of the first sequence from the second. The homology may suitably 
be determined by means of computer programs known in the art, 
such as GAP provided in the GCG program package (Program Manual 
for the Wisconsin Package, Version 8, August 1994, Genetics 
Computer Group, 575 Science Drive, Madison, Wisconsin, USA 

35 53711) (Needleman, S.B. and Wunsch, CD., (1970), Journal of Mo- 
lecular Biology, 48, 443-453). Using GAP with the following set- 
tings for DNA sequence comparison: GAP creation penalty of 5.0 
and GAP extension penalty of 0.3, the homologous DNA sequences 
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of interest exhibits a degree of identity preferably as de- 
scribed above. 

The term "Recombined DNA sequence" means the DNA se- 
quence which is a result of a recombination event between at 
5 least two DNA sequences. The recombination event may be a single 
recombination event or may be first a recombination event be- 
tween at least two DNA sequences followed by a cross out at a 
different site than the first recombination event. 

The term "DNA structure" means a DNA molecule, e.g. a 
10 vector, a plasmid, a chromosome (e.g. a Bacillus subtilis chro- 
mosome) , a virus, a phage, a transposon or a genome. 

The term "vector" means any linear or circular DNA mo- 
lecule that is able to contain a DNA sequence to be recombined 
and an origin of replication. The term includes any vector known 
15 to the skilled artisan e.g. a plasmid or a phage. The DNA se- 
quence is optionally-but not necessarily- under control of a 
promoter. 

The term "a cell" used herein in connection with a in 
vivo recombination process of the invention is intended to cover 
20 both "a cell" and "a cell population". 

The term "transferring" r.eans that the DNA structure is 
introduced into a cell. The DNA structure may be introduced using 
techniques known in the art for direct introduction of DNA, e.g. 
by use of electroporation, transformation of competent cells, pro- 
25 toplast transformation, transection, transduction, conjugation or 
ballistic transformation. 

The term "incubating" (or cultivating, growing) means 
that the cell is treated in such a way that the normal cell 
functions are active, especially the functions necessary for re- 
30 combination. The cell can be incubated on a solid substrate 
(agar petri disc) or the cell can be incubated under submerse 
conditions. It is preferred that the cell is incubated in a test 
tube, e.g. an eppendorph test tube. In the case of a tempera- 
ture-sensitive origin of replication, the test tube is conven- 
35 iently placed in an automatic termocycler. Further the cell may 
be incubated in standard shake-flaks. 

The term "origin of replication" (ori) means a region 
where replication of a DNA structure is initiated. The term 
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functional in connection with origin of replication means that 
replication can be initiated 'from the origin* The term non- 
functional means that no replication is initiated under the 
given conditions, or that the initiation of replication is redu- 
5 ced, e.g. to a level insufficient for survival of a cell con- 
taining a DNA structure with a marker gene and the ori, under 
appropriate selection pressure. 

The term "temperature sensitive" in connection with ori- 
gin of replication or vector means the vector is unable to rep- 

10 licate at e.g. increased temperatures, i.e. non-permissive con- 
ditions, which yet permit growth of the parent cell. 

The term "temperature resistant" in connection with 
origin of replication or vector means the vector is able to 
replicate at increased temperatures. 

as The term "DMA library" is a library comprising numerous 

different DNA sequences. A DNA library may be made according to 
standard procedures known in the a::, such as e.g. by in vitro 
DNA shuf f ling/DNA recombination (*.0 95/17413, WO 95/22625) 
and/or Error-prone PCR and cassette mutagenesis. 

20 A DNA library herein may comprise as„low as 2-20 

different sequences, as e.g. a library where only one amino acid 
is varied. Further a DNA library nay comprise numerous different 
DNA sequences; e.g. up to 10 10 different sequences. 

2 5 Detailed Description of the Invention 

The present invention comprises a' process for in vivo 
recombination of homologous DNA sequences, comprising the steps 

(a) incubating a cell containing at least two DNA struc- 
tures, each comprising a DNA sequence and an origin of 

30 replication, under conditions where i) at least one ori- 

* gin of replication is non-functional and ii) the cell is 
under a selective pressure which only allows growth of 
the cell if it contains the DNA structure which is hav- 
ing the non-functional origin (i.e. favouring an inte- 

35 gration of one of the DNA-structures into one of the 

others, thereby forming a hybrid DNA structure); 

(b) changing of said conditions to conditions, where fewer 
origins of replication are non-functional than in step 



WO 98/28416 



8 



PCT/DK97/00567 



a) (i.e. favouring a crossing out from said hybrid DNA 
structure) ; 

(c) repeating steps a)-b) at least once. 

5 The recombined DNA sequence will encode new polypeptides 

having new properties. It is then possible to incorporate a se- 
lection or screening system in the process of the invention in 
order to select and/or screen for desired properties encoded by 
of or more of the recombined DNA sequence(s). 
10 The number of repetition (s) in step (c) above nay be one 

repetition or it may be more such as 5-10 repetitions or 50-100 
or even higher numbers of repetitions. 

It is preferred that integration is forced (or made fa- 
vorable for survival of the DNA structure) by incubation of the 
15 cell under such conditions whers the origin of replication of a 
DNA structure is non-functional, and that cross out is forced by 
incubation of the cell under su.:r. conditions where the origin of 
replication of a DNA structure again is functional. 

In another preferred embodiment the routine depicted 
20 above in the steps (a) to (c) it is preferred that the cells are 
kept under constant conditions {e.g. constant temperature and 
selection pressure) where spontaneous cross-in and cross-out 
happens. That is the shuffling procedure in the cells is re- 
peated without changing the conditions repeatedly. 
25 In a number of DNA structures, such as in many Bacillus 

DNA structures (vectors) (Bacillus subtilis and Other Gram- 
Positive Bacteria, Sonensheim et al. ; 1993, American Society for 
Microbiology, Washington D.C.) the origin of replication com- 
prise a ori(+) and a ori (-) . ori(+) controls where replication 
30 of the first DNA strand starts, and ori(-) controls the ini- 
tiation point for replication op. the second strand (Bacillus 
subtilis and Other Gram-Positive Bacteria, Sonensheim et al., 
1993, American Society for Microbiology, Washington D.C). 

In an embodiment of the process of the invention, at 
35 least" one DNA structure does not comprise ori(-) (i.e. only com- 
prise ori(+)). Preferably the DNA structure which does not com- 
prise ori(-) is a DNA structure which origin (i.e. herein 
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ori(+)) is functional during the hybrid DNA structure formation 
performed according to the invention. 

One advantage of this embodiment of the invention is 
when only ori(+) is functional in said DNA structure, this DNA 
structure is in a single-stranded state for relatively longer 
period, compared to a DNA structure comprising both ori(+) and 
ori(-). This prolonged single-stranded state facilitate the hy- 
brid DNA structure formation according to a process of the in- 
vention. 

Preferably, the DNA structure which does not comprise 
ori(-) as described above is a Bacillus structure, more pre- 
ferred a Bacillus vector. 

In order to illustrate this concept, an example is a 
modified pTR (TR: temperature resistence) Bacillus vector which 
does not comprise ori(-) (see figure 1). If the hybrid formation 
is performed at 50°C only the origin of pTR is functional 
(origin of pTS (TS: temperature sensitive) is not functional) 
(see fig. 1,2). The DNA structure pTR only comprise ori(+) and 
is therefore in a single-stranded state for relatively longer 
3 period, which facilitate the recombination event and conse- 
quently hybrid formation with the pTS DNA structure (see fig. 
1,2) . 

It is more preferred that one DNA structure is a vector 
which is able to replicate under certain (permissive) condition's 

5 and unable to replicate under other (non-permissive) conditions'. 
The vector may, for instance, be one which is temperature- 
sensitive for replication. Thus, the vector may be one which is 
unable to replicate at increased temperatures, which yet permit 
growth of the parent cell. The cell is initially incubated at a 

o temperature permitting vector replication and subsequently after 
integration into the other DNA structure may have taken place, 
incubated at a temperature which does not permit vector repli- 
cation so that the vector is lost from the cells unless inte- 
grated . 

5 The vector may further comprise a selectable marker. In 

this case, the incubation at the non-permissive temperature may 
be conducted under selective conditions to ensure that only 
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cells containing the integrated vector (which includes the DNA 
sequence and the selectable marker), will survive. 

Further, it is preferred to construct a system where it 
is possible in vivo to measure formation of the hybrid struc- 
5 ture. 

Preferably, this is done by constructing a system 
wherein an in vivo screenable or selectable protein is expressed 
only after the hybrid structure has been formed. 

Preferably, said screenable protein is a protein which 
10 is fluorescent under suitable exposure, and in particular said 
fluorescent protein is Green Florescent protein (GFP) or vari- 
ants thereof. 

Green Florescent protein (GFP) or variants thereof and hov 
the GFP's are activated to fluoresce are described in the art 

15 and reference is made to (Crameri et al. Nature Biotechnology 

14:315-319 (1996); and Cormack et al. GENE 173:33-38 (1996)) for 
further details. 

One of the advantages of using such a screenable protein 
which is fluorescent under suitable exposure of light, and in 

20 particular GFP or variants thereof, is that it is then possible 
by e.g. fluorescent-activated cell sorting (FACS) technology to 
selectively sort out. cells which comprise said hybrid structure. 
This nay eliminate background of cells, which do not comprise 
said hybrid structure after performing of step a) according to 

25 the first aspect of the invention. 

FACS is a widely known screening technology and reference 
is made to (Cormack et al. " FACS- optimized mutants of the green 
fluorescent protein (GFP)" GENE 173:33-38 (1996)) for further 
details. 

In order to illustrate a suitable system enabling in 
vivo measurement of the formation of the hybrid structure, an 
example of a suitable system is shown in figure 1 and 2. 

Alternatively to the use of a screenable protein as dis- 
cussed above, it may be advantageously to use a selectable 
35 marker, wherein said selectable marker is only transcriptionally 
active when the hybrid structure is formed. 
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Further, it is preferred to that the crossing cut from 
the hybrid DNA structure, in "step b) of the first aspect of the 
invention, is a forced crossing out event. 

Forced crossing out from the hybrid structure may be 
s performed by: 

i) constructing a system wherein the DNA hybrid structure 

does not express a selectable marker, and at least one 
of the at least two DNA structures in step a) of first 
aspect is able to express said selectable marker; 
10 ii) incubating the cell under selectable conditions, 

thereby forcing a cross out of the DNA hybrid struc- 
ture, since the cell is- only able to express the 
marker gene after the crossing out of the DNA hybrid 
structure has taken place. 
15 Preferably, selectable marker gene mentioned above is 

expressed on a DNA structure where the origin of replication is 
non-functional during a forced integration of the DNA structures 
into a DNA hybrid structure performed as described above, and 
preferably the selectable marker gene is an antibiotic resis- 
20 tance gene. See below for further description of suitable se- 
lectable marker genes. 

In order to illustrate the concept of forced crossing 
out from the hybrid DNA structure, an example of a suitable sys- 
tem is shown in figure2 1 and 2. 
25 The selectable marker may be any marker known in the art, 

for instance a gene coding for a product which confers anti- 
biotic resistance to the cell, which confers prototrophy to an 
auxotrophic strain, or which complements a defect of the host, 
(e.g. dal genes introduced in a dal- strain; cf. B. Diderichsen 
30 (1986) , Bacillus: Molecular Genetics and Biotechnology Applica- 
tions, A.T. Ganesan and J. A. Hoch, Eds., Academic Press, pp. 35- 
46. The selectable marker may, e.g., be excised from a known 
source or present on a vector, e.g. a plasmid, used for the con- 
struction of the DNA structure to be used in the method of the 
35 invention. 

This selection may be accomplished by growing the cells 
under selection pressure for the selection marker encoded by the 
marker gene. 
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The DNA structure (in which the vector is to be 
integrated) may contain a gene encoding any selectable marker, 
e.g. of any type as described above in connection with the 
marker optionally carried by the vector to be used in the method 
of the invention. Thus, the marker gene may encode an antibiotic 
resistance such as resistance to kanamycin, tetracyclic 
ampicillin, erythromycin, chloramphenicol, or a resistance to 
various heavy metals such as selenate, antimony or arsenate. 

In a preferred embodiment, the cell used in the process 
is defective or transitorily inactivated in the enzymatic mis- 
match repair system or has a reduced level of mismatch repair. 
In the synthesis of DNA , errors can occur and the resulting non- 
complementary pairs of bases are called mismatches. A process 
for the correction of errors (mismatches) in DNA exists. In E. 
coli the errors are very rapidly and accurately detected by two 
enzymes (MutS and MutL) enabling a third enzyme (MutU) to unwind 
the two DNA strands and a fourth enzyme (MutH) to cut the neo- 
synthesized strand on a sequence of DNA (GATC) itself methylated 
later. For a closer explanation, reference is made to EP 449923 
(Setratech) , the contents of which hereby are incorporated by 
reference . 

In another preferred embodiment the cell used in the 
process is impaired or hampered in the major recombination en- 
zymes such as AddAB . It has been shown that a strain impaired in 
5 the AddAB genes shows a higher frequency of homologous recombi- 
nation. Both legitimate and illegitimate recombination frequency 
are increased. But especially the recombination frequency for 
illegitimate recombination is favoured when working with a 
strain impaired or hampered in the AddAB ATP-dependent nuclease 
o (Meima, R ; Haijema, BJ; Dijkstra, H; Haan, GJ; Venema, G; Bron, 
S (1997) Role of enzymes of homologous recombination in ille- 
gitimate plasmid recombination in Bacillus-subtilis . JOURNAL OF 
BACTERIOLOGY Vol. 179 , No. 4 pp. 1219-1229.) 

In another preferred enbodiir.ent the cell used in the 
5 process is over-expressing one of the key recombination enzymes, 
recA. The recA enzyme is involved in homologous recombination by 
promoting renaturation of ssDNA to form the heteroduplex molecules 
necessary for the recombination event and initiates the exchange 
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of- DNA strands. (PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR 
BIOLOGY Vol. 56 , pp. 129-223 " (1997) ) . The recA over-expression 
can be achieved by either classical mutagenesis of the host strain 
or cloning of a highly expressed recA gene. 
5 In another preferred embodiment the cell used in the 

process is subjected to UV, X-ray or DNA damaging agents to 
increase recombination frequency between the two homologous genes 
to be shuffled. Stress like UV, X-ray or DNA damaging agents is 
known to induce the recombination enzymes via the SOS response and 
10 activate enzymes involved in recombination processes like recA. 
(BACILLUS SUBTILIS AND OTHER GRAM-POSITIVE BACTERIA, pp. 529-537 
(1993)). 

In another preferred embodiment the pTR based plasmid used 
as a carrier of one of the genes to be shuffled is replaced by 

15 either a low-copy version of pTR or another B. subtilis compatible 
low-copy plasmid like pWVOl, (MOLECULAR AND GENERAL GENETICS Vol-._ 
249 , No. 1 pp. 43-50 (1995 )), pTAIOSO, (JOURNAL OF BACTERIOLOGY 
Vol. 142 , No. 1 pp. 315-8 (1930)) or pAMbl , (JOURNAL OF 
BACTERIOLOGY Vol. 157 , No. 2 pp. 445-53 (1984)). 

20 In a further preferred embodiment the one of DNA construct 

comprising one the genes to be shuffled is situated on the 
chromosome of the cell and another DNA construct, comprising 
another gene of interest to be recombined is a plasmid. When 
performing an in vivo recombination process of the invention the 

25 plasmid will be recombined with the DNA construct on the 

chromosome. Said plasmid may e.g. be a temperature sensitive 
plasmid. 

The advantages of using a low-copy pTR, or a system where 
one of the DNA constructs is situated on the chromosome as 

30 described above, is that the hybrid DNA construct after 

recombination of the two homologous genes will be the predominant 
form of the genes in the cell. A high-copy background of non- 
recombined pTR plasmids will inevitably reverse the gene hybrids 
formed in the first round back to wildtype after the second round 

35 of recombination and therefore prohibit any shuffling of the two 
genes. When using the chromosome as a carrier of one of the genes 
no wildtype background can coexist when the pTS has integrated via 
the homologous genes. 
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It is preferred that the cell used in the process is a cell 
suitable for large scale production of the product of the 
recombined DNA sequence. It is preferred that the cell is a 
bacterial cell, more preferred a Bacillus cell, most preferred a 
5 Bacillus subtilis cell. 

The DNA sequences to be recombined may be isolated/cloned 
from organisms containing the sequences in a manner known per se. 
For instance, a suitable oligonucleotide probe may be prepared 
on the basis of a first DNA sequence of interest in order to 
10 obtain a second DNA sequence homologous to the first sequence. 
Alternatively, the DNA sequence can be amplified by PCR based on 
primers designed according to knowledge of conserved regions. 

I different embodiments of the present content is used 
the term "a set of homologous DNA sequences ("set A" "set B" , 
"15 "setC" etc.)"/ e.g. in relation to an embodiment of the 
'"*:: invention relating to a process according to the invention, 
wherein the homologous DNA sequences are comprised of two or 
more different set of homologous DNA sequences ("set A" "set B", 
"setC" etc.), and wherein each of said set of homologous 
20 sequences ("set A" "set B", "setC" etc.) are inserted in a 

different DNA construct ("DNA construct A", "DNA construct B", 
"DNA construct C" ere.) according to the invention. 

This term is used to illustrate that an in vivo 
recombination process of the present invention may be used to 
25 recombine numerous "sets" of homologous DNA sequences. Each 

individual "set" of homologous DNA sequence (s) may individually 
comprise from only one to numerous (e.g. a "set" comprising a 
DNA library) individual homologous DNA sequence (s) . 

Each of said "sets" of homologous DNA sequences ("set A" 
30 "set B" , "setC" etc.) may then be inserted in a different DNA 
construct ("DNA construct A", "DNA construct B" , "DNA construct 
C" etc.) according to the invention. Each of said different DNA 
construct are preferably different on important parameters, 
according to the invention, such as different origin of 
35 replication, different selectable markers etc. 

All of said DNA constructs ("DNA construct A", "DNA 
construct B", "DNA construct c" etc..) may then be transferred 
into a cell, according to the invention, and when performing a 
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process for in vivo recombination of the invention, then each of 
the individual DNA constructs ("DNA construct A" , "DNA construct 
B n , "DNA construct C" etc..) may then recombine randomly 
according to the invention. After a few repetitions of a 
5 process, of the invention, this will give a DNA population, in 
the cell population, comprising numerous different combinations 
of different recombined DNA sequences, recombined randomly 
between all "sets" of homologous DNA seguences used in the 
process . 

10 Further, the term "a set of homologous DNA seguences 

("set A" "set B" , "setC" etc.)" is used herein to illustrate one 
of the advantages of an in vivo recombination process of the 
invention, which is the flexibility of the system to recombine 
both set of homologous DNA sequences comprising one, few and/or 

15 numerous (e.g. big DNA libraries) of individual homologous DNA 
sequences . 

Accordingly, an embodiment of the invention relates to 
process according to the invention, wherein said sets of 
homologous DNA sequences each only contain one DNA sequence 
20 (i.e. recombination of two or more honologous sequences). 

Further in embodiments of the invention the homologous 
DNA sequences to be recombined are derived from a pre-made DNA 
library and/or derived from individually different pre-made 
libraries . 

25 Homologous DNA derived from such DNA libraries are 

inserted into a vector according to the invention (i.e. DNA re- 
presenting the DNA library is ligared (inserted) into a vector). 
As described herein the process of the invention comprise at 
least two DNA structures (e.g. vectors), which are different in 

30 a way which allow forced in vivo recombination of said vectors. 
Homologous DNA representing' DNA libraries may be 
inserted into only one and/or both of said different vectors. 

I.e. if for instance homologous DNA representing one DNA 
library is inserted (ligat'ed) into one of said different vectors 

35 and homologous DNA representing another DNA library is inserted 
into the other vector this will allow fast and efficient 
recombination between two different libraries by an in vivo 
recombination process according to the invention. To illustrate 
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this concept, DNA representing one library may be insprted into 
a pTR vector (see figure 1) a'nd DNA representing another library 
may be inserted into a pTS vector (see figure 1) . Just a few re- 
peating in vivo recombination events performed by a process ac- 
5 cording to the invention will then give a cell population 
representing a new library which is a combination of the two 
libraries mentioned above. 

Accordingly, embodiments of the invention relates to a 
process according to the invention, wherein at least one of the 
10 sets of homologous DNA sequences is derived from a pre-made DNA 
library, and at least one of said sets of homologous DNA 
sequences is a DNA sequence which . is homologous to said DNA 
library (i.e. resulting in a process for in vivo recombination 
of said DNA library to a DNA sequence which are homologous to 
15 said DNA library) ; or 

a process according to the invention, wherein the sets 
of homologous DNA sequences is derived from the same pre-nade 
DNA library (i.e. resulting in a process for in vivo 
recombination of said DNA library) ; or 
20 a process according to the invention, wherein the sets 

of homologous DNA sequences are derived from at least two 
different pre-nade DNA libraries (i.e. resulting in a process 
for in vivo recombination of at least two different DNA 
libraries) . 

s In order to obtain even further recombination between the 

DNA sequences it may be preferred to perform an isolation of the 
DNA structures after a first round of in vivo recombination of 
said DNA sequences performed according to the invention. Said 
isolated structures are then re-transformed into the cells and a 

o further process round of in vivo recombination is then 
performed . 

The advantages of this step is illustrated by following 
example : 

Set A is a vector A wherein DNA sequences a, b are 
5 introduced individually in said vector A (i.e. the vector A 
population comprises a vector with DNA sequence a, and a vector 
with sequence b) ; and 
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Set B is a vector B wherein DNA .sequences c, d are 
introduced individually in said vector B (i.e. the vector B 
population comprises a vector with DNA sequence c, and a vector 
with sequence d) . 
5 Vector A and B are then transformed into a cell 

population and in vivo recombined according to the invention. 
This will give recombination between DNA sequences ac, ad, be, 
bd. The vectors A, and B comprising said recombined sequences is 
isolated, and re-transformed into said cell population. In vivo 

10 recombination according to the invention will then give all 
possible recombination of said four (a,b,c,d) DNA sequences, 
such as recombination of abc, abdc, etc. 

In even further embodiments the invention relates to a 
process according to the invention, wherein two sets of homolo- 

15 gous DNA sequences ("set A" and "set B") are recombined in vivo, 
comprising the steps 

a) inserting sequence(s) fror. "set A" in a vector contain- 

ing a temperature resistant origin of replication and a 
gene for a first marker; 
20 b) inserting sequence(s) fro- "set B" in a vector contain- 

ing a temperature sensitive origin of replication and a 
gene for a second marker; 

c) transferring the vectors into a cell; 

d) Incubating the cell at a temperature where both origins 
25 of reolication are functional, optionally under a selec- 
tion pressure that favours cells with both marker genes; 

e) shifting the temperature to a temperature, where only 
the temperature resistant origin of replication is func- 
tional ; 

30 f) repeating steps d)-e) at least once; 



or 



a process according to the invention, wherein two sets 
35 of homologous DNA sequences ("set A" and "set B") are recombined 
in vivo, comprising the steps 
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a) ■ inserting sequence (s) from "set A n in a vector contain- 

ing a temperature resistant origin of replication and a 
gene for a first marker; 

b) inserting sequence(s) from "set B" in. a vector contain- 
ing a temperature sensitive origin of replication and a 
gene for a second marker; 

c) transferring the vectors into a cell; 

d) Incubating the cell at a temperature where only the tem- 
perature resistant origin of replication is functional, 
optionally under a selection pressure that favours cells 
with both marker genes; 

e) optionally, shifting the temperature to a temperature, 
where both origins of replication are functional; 

f) repeating steps d)-e) at leair: once, 

In a further aspect the invention rel3t.es to a method for 
producing one or more recombinant prttein(s) having a desired 
biological activity comprising: 

a) performing a process for in vivo recombination of homo- 
logous DNA sequences (the process is performed according 
to the invention as described herein) , wherein at least 
one of the DNA structure (s) is an expression vector; 

b) expressing the numerous different recombinant polypeptides 
encoded by the numerous different recombined DNA sequences 
from step a) ; and 

c) screen or select the numerous different recombinant 
proteins from step b) in a suitable screening or selection 
system for one or more recombinant polypeptides (s) having 
a desired activity. 

Preferably, the expression vector in step a) above is a DNA 
structure which is unable to replicate (e.g. having a non- 
functional origin of replication) during a forced hybrid DNA 
structure formation (i.e. integration of said structures into 
each other) performed according to a process for in vivo 
recombination of homologous DNA sequences according to the 
invention. This will improve the probability that the expression 
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vector, after a crossing-out event of said hybrid-structure 
according to the invention, comprise recombined homologous DNA, 
since the only way said expression vector may have survived the 
process of forced hybrid formation is by actually recombine with 
5 another DNA structure, which is able to replicate under the 
conditions used to force said hybrid-structure formation • 

After the process for in vivo recombination have been 
performed (step a) above) , and before the expression and 
screening steps b) and c) above, it may be advantageously to 

ao make a cell population mainly comprising only the expression 
vector (i.e. other DNA construct (s) are partly or nearly 
completely removed from the cell -population) . This is particular 
advantageously when the expression vector is unable to replicate 
during the forced hybrid DNA structure formation as described 

15 immediately above, since such a cell population will have a very 
limited background of non-recombined DNA sequences of interest, 
which is advantageously in the subsequent screening step (step 
c) above) . 

Making a cell population mainly comprising only the 
20 expression vector may be performed according to standard pro- t . 
cedures known in the art, such as growing the cell population at 
conditions selectively favouring replication of the expression 
vector, as compared to the other DNA structures; or selectively 
purify said expression vector form the cells and subsequently 
25 transfer this into a new cell population. 

Expression the recombinant polypeptides encoded by the 
recombined sequence in step a) above nay be performed by use of 
standard expression vectors, and/or a standard expression vector 
nay be modified in order to comprise elements specially suitable 
30 for a in vivo recombination process according to the invention. 

A suitable screening or selection system, to screen or se- 
lect the numerous different recombinant proteins from step b) 
above, will depend on the desired biological activity. 

A number of suitable screening or selection systems to 
35 screen or select for a desired biological activity are described 
in the art. Examples are: 
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Strauberg et al. (Biotechnology 13: 669-673 (1995), which 
describe a screening system to screen for Subtilisin variants hav- 
ing a Calcium-independent stability; 

Bryan et al. (Proteins 1:326-334 (1986)), which describe a 
5 screening assay to screen for Protease having a enhanced thermal 
stability; and 

PCT-DK96/00322 which describe a screening assay to screen 
for lipases which are having an improved wash performance in wash- 
ing detergents, 

10 a preferred embodiment of the invention comprise screening 

or selection of recombinant protein(s), wherein the desired bio- 
logical activity is improved performance in dish-washing or laun- 
dry detergents. Examples of suitable dish-wash or laundry deter- 
gent are disclosed in PCT/DK96/00322 and WO 95/30011. 

15 The invention also comprises a set (at least two) of 

vectors, comprising at least two different origins of replica- 
tion able to function in the sane cell under permissive con- 
ditions, and at least two homologous DNA sequences encoding 
polypeptides. In a preferred embodiment, at least one origin of 

20 replication is temperature sensitive. The vector may preferable 
be a plasmid. 

The invention 'further comprises a vector system, com- 
prising at least 

a) a vector A containing a DNA sequence encoding a polypep- 
25 tide and an origin of replication, and 

b) a vector B containing a DNA. sequence homologous to the 
DNA sequence in vector A, and an origin of replication 
that differs from the origin of replication in vector A. 
The vectors may preferable be plasmids. 

30 

It is preferred that the DNA sequence encodes polypep- 
tides with a biological activity, especially enzym.es and in par- 
ticular enzymes such as an amylase, lipase, cutinase, cellulase, 
oxidase, phytase, and a protease. 
35 The invention further comprises a new, recombined DNA 

sequence, characterised in that has be produced by a process ac 
cording to the invention, or it is encoding a recombinant pro- 
tein having a desired biological activity, wherein said recombi 
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nant protein having a desired biological is produced by the 
method according to method for producing one or more recombinant 
protein (s) having a desired biological activity according to the 
invention. 

5 The invention further comprises a process for producing 

a polypeptide, especially an enzyme, comprising i) growing a 
cell expressing a polypeptide encoded by a recombined DNA se- 
quence according to claim 38, and ii) purifying said expressed 
polypeptide. 

10 The invention further comprises a new polypeptide, espe- 

cially an enzyme, characterised in that it is produced by a 
process according to the invention. 

The invention further comprises a polypeptide, espe- 
cially an enzyme, characterised in that it is encoded of a re- 
15 combined DNA sequence according to the invention. 

Polypeptides of the present invention include peptides and 
proteins, and optionally glycosylated forms thereof. Examples are 
enzymes, pharmaceutical^ active polypeptides such as analogues 
of human or mammal polypeptides, e.g. insulin, growth hormone, 
20 human hormones or growth regulators. 

The invention is described in further detail in the 
following examples which are not in any way intended to limit the 
scope of the invention. 

2 5 EXAMPLES 

MATERIALS AND METHODS 

Strains . 

30 E.coli: SJ2 (Diderichsen, B., Wedsted, U., Hedegaard, L. , 

Jensen, B. R. , Sjoholm, C. (1990) Cloning of aldB, which encodes 
alpha-acetolactate decarboxylase, an exoenzyme from Bacillus 
brevis. J. Bacteriol., 172, 4315-4321) Electrocompetent cells 
prepared and transformed using a Bio-Rad GenePulser™ as 

35 recommended by the manufacturer. 

B.subtilis DN1885 Diderichsen, B., Wedsted, U. , Hedegaard, L. , 
Jensen, B. R. , sjoholm, C. (1990) Cloning of aldB, which encodes 
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alpha-acetolactate decarboxylase, an exoen 2ym e from Bacillus 
trevis. J. Bacterid., 172, 4315-4321). Competent cells were pre- 
pared and transformed as described by Yashin, R.E., Wi i son G A 
and Young, F.E. (1975) Transformation and transfection in ly- ' " 
s sogenic strains of Bacillus suttilis : evidence for selective in- 
duction of prophage in competent cells. J. Bacterid, 121:296-304. 

B .suttilis PU801. This strain is the B.subtilis DN1885 with 
disrupted apr and npr genes (Diderichsen, B., Wedsted u 
io Hedegaard, L. , Je nsen, B . R. , sjoholm, c. (1M0 , cionIng"of 
aldB, which encodes alpha-acetolactate decarboxylase an 
exoenzyme from Bacillus trevis. J, Bacteriol., 172, 4315-4321). 

Plasmids : 

» PSX120: 3. suttilis expression vector (Described in EP 405 90 , 

-a „o 91/ oo34s,. This vector co , ?rise the vi - 

gene . 

PE194: (Horinouchi, s and Weisbli- 3 igs? 7 o a - • , 
150:804-814). 

=0 PSX222: B. su^ilis e x press 10 n vector (Described in WO 96/ 3 4946 
ana „o 92/19729) . This vect or _ prise the synthat . c ' - 
gene herein caHed Savisyn. In the Savisyn 9e „e is introduced a 
large „ um ber o, restriction sites as compared to the vild-tvp. 

Savmase gene comprised in pSX120. 

25 

PUC19: Yanisch-Perron, C ^>i&>~\x - w 

Gene 33, 103 -ii 9 . ' " ^ M6SSlng ' J " (1985 > 

PF64L-S65T-GFP: is the E.coli plasnid encoding the DNA 
30 encoding the F64L-S65T-GFP described in WO 97/11094. 

PMUTIN-4-MCS: Plasmid can be obtained from Laboratoire de Geneti- 
que Microbienne, Institut National de la Recherche Agronoaique 

78352. Jouy en Josas - CEDEX, France. 

35 

Solutions/Media 
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TY and LB agar (as described in Ausubel, F. M. et al. 
(eds.) "Current protocols in Molecular Biology". John Wiley and 
Sons, 1995). 

LBPG is LB agar supplemented with 0.5% Glucose and 0.05 M 
5 potassium phosphate, pH 7.0 

LBPGSK plates are LBPG media with agar and 1% of skimmed 

milk. 

Antibiotics are used in different medias in concentrations 
of 6 |ig/ml Chloramphenicol, 10 |ig/ml Kanamycine and 5 ug/ml og 

10 Erythromycine. 

TE-buffer (as described in Ausubel, F. M. et al. (eds.) 
"Current protocols in Molecular 3iology". John Wiley and Sons, 
1995) . 

15 General molecular biology methods 

DNA manipulations and transformations were performed using 
standard methods of molecular biolcgy (Sambrook et al. (1989) 
Molecular cloning: A laboratory manual, Cold Spring .Harbor lab., 
Cold Spring Harbor, NY; Ausubel, F. M. et al. (eds.) "Current 
20 protocols in Molecular Biology". Jchn Wiley and Sons, 1995; 

Harwood, C. R . , and Cutting, S. M . (eds.) "Molecular Biological 
Methods for Bacillus". John Wiley and Sons, 1990). 

Enzymes for DNA manipulations vere used according to the 
specifications of the suppliers. 

25 

Enzymes for DNA manipulations 

Unless otherwise mentioned all enzymes for DNA 
manipulations, such as e.g. restriction endonucleases , ligases, 
polymerases, PGR polymerases etc., are obtained from New England 
30 Biolabs,, Inc. 

Construction of the temperature sensitive shuffling pl asmid 

pMB4 3 0 was constructed in a five step cloning process as 
depitched in the following, the entire DNA sequence is found in 
35 SEQ ID No. 1. 
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The temperature sensitive plasmid pE194 was modified by intro- 
ducing multiple cloning sites around the unique Clal site of the 
pE194 plasmid and thus destroying the Clal site. This introduc- 
tion was done by PCR using the primers #22899 and #24039 to am- 
5 plify the whole plasmid and introducing several unique cloning 
sites , 

The PCR was performed as follows: 

1 fil of pE194 Qiaquick plasmid prep was used for 50 pi 
PCR amplification in PCR buffer (10 mM Tris-HCl, pH 8.3, 50 mM 
o KCl, 1.5 mM MgCl 2 , 0.01 % (w/v) gelatin) containing 200 of 
each dNTP, 2.5 units of AmpliTag polymerase (Perkin-Elmer , 
Cetus, USA) and 100 pmol of each-pirimer: 
#22899 

5 1 -GCAGCTGGATCCGCGGCCGCGAATTCG7CTTTTGCGCAGTCGGC-3 1 

5 

#24039 

5 ' -GCAGCTGG AT CCGGGCCCGGG AAG C TT C 3 A TT C A C AAAAAA T AG GCACACG-3 ' 

The PCR reactions were performed using a DNA thermal 
o cycler (Landgraf, Germany). One incubation at 94°C for 1 min 
followed by forty cycles of PC?, performed using a cycle profile 
of denaturation at 94°C for 1 min, annealing at 55°C for 1 min, 
and extension at 72°C for 2 min. Five-ul aliquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % 
5 agarose gels (NuSieve, FMC; . 

Subcloninq of PCR fragment 

Forty-f ive-fsl aliquots of the PCR products generated as 
described above were purified using QIAquick PCR purification 

o kit (Qiagen, USA) according to the manufacturer's instructions. 
The purified DNA was eluted in 50 ^1 of lOmM Tris-HCl, pH 8.5. 
Fifty -/il of the purified PCR fragment "was digested with BamHI, 
electrophoresed in 0.8 % low gelling temperature agarose 
(SeaPlaque GTG , FMC) gels, the relevant fragment was excised 

5 from the gels, and purified using QIAquick Gel extraction Kit 
(Qiagen, USA) according to the manufacturer's instructions. The 
isolated DNA fragment was then ligated to BamHI digested pUC19 
and the ligation mixture was used to transform E. coli SJ2 . 
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Cells were plated on LB agar plates containing Erythro- 
mycin (200 Mg/Nl) supplemented with X-gal (SIGMA, USA) (5-Bromo- 
4-chloro-3-indolyl alpha-D-galactopyranoside, 50 /ig/ml) . 

5 identification and characterisation of positive clones 

The transformed cells were plated on LB agar plates 
containing Erythromycin (200 /zg/ml) supplemented with X-gal (50 
/jg/ml) and incubated at 37°C over night. Next day white colonies 
were rescued by restreaking these onto fresh LB- Erythromycin 

10 agar plates and incubated at 37°C over night. The next day 
single colonies of each clone were transferred to liquid LB 
medium containing Erythromycin (200 ' /ig/ml) and incubated 
overnight at 37°C with shaking at 250 rpm. 

Plasmids were extracted from the liquid cultures using 

15 QIAgen Plasmid Purification mini kit (Qiagen, USA) according to 
the manufacturer's instructions. Five-/U samples of the plasmids 
were digested with BamHI . The digestion were checked by 
gelelectrophoresis on a 0.7 % agarose gel (NuSieve, FMC) . The 
appearance of two DNA fragments of about 2.7 kb and 3.7 

20 indicated a positive clone. The clone was designated MB293 and, 
the plasmid pMB293. 

2) 

PMB253 was then EcoRI digested and the 3.8 kb fragment was 
25 gelpurified on a 0 . 7 % agarose gel and using QiaQuick gelpurifi- 
cation kit from Qiagen according to the manufacturer. This frag- 
ment which constitutes the pE194 with inserted multiple cloning- 
sites (originating from the PCR primers and from MCS of pUC19) 
was circulised by ligation and used to transform B.subtilis 
30 DN1885, transformed cells were selected on LBPG plates contain- 
ing 5 V>q/™1 Erythromycin and incubation was done at 33 °C over- 
night. Clones from this transformation were restreaked and cul- 
tivated overnight in TY with 5 fig/ml Erythromycin. From over- 
night- culture broth cells were isolated and plasmids were puri- 
35 fied using Qiaquick plasmid kit. Restriction enzyme digestion of 
the plasmids verified the correctness of the clones. One such 
clone and plasmid were termed MB333 an pMB333. 
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3) 

The pMB333 was then used as the temperature sensitive plasmid 
for the construction of a plasmid containing the the Savizyn 
gene without any promoter. This construct was made as follows. 

PCR using the primers #21547 and #21548 and the pSX222 as 
the template was performed according to these conditions: 

1 pi of pSX222 Qiaquick plasmid prep was used for 50 pi 
10 PCR amplification in PCR buffer (10 mM Tris-HCl, pH 8.3, 50 mM 
KC1, 1.5 mM MgCl 2 , 0.01 % (w/v) gelatin) containing 200 pM of 
each dNTP, 2.5 units of AmpliTaq polymerase (Perkin-Elmer , 
Cetus, USA) and 100 pmol of each primer: 
#21547 

15 5 1 -GCAGCTGCAGGATCCGAATTCGCGGCCGCGCAATCGGTACCATGG-3 
#21548 

5 » -GCAGCGAGCTCAAGCTTCCCGGGCCCAGCCGGTCGACCGCGTTGCCGCTTCTGCG-3 ' 

The PCR reactions was performed using a DNA thermal 
20 cycler (Landgraf, Germany). One incubation at 94°C for 1 min 
followed by forty cycles of PCR performed using a cycle profile 
of denaturation at 94°C for 1 rain, annealing at 55°C for 1 min, 
and extension at 72 °C for 2 min. Five-pl aliquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % 
25 agarose gels (NuSieve, FMC) . 

Snbclonino of PCR fragment 

Fortyfive-pl aliquots of the PCR products generated as 
described above were purified using QIAquick PCR purification 
30 kit (Qiagen, USA) according to the manufacturer's instructions. 
The purified DNA was eluted in 50 pi of lOmM Tris-HCl, pH 8.5. 
Fifty -pi of the purified PCR fragment was digested with BamHI 
and Apal, electrophoresed in 0.8 % low gelling temperature 
agarose (SeaPlaque GTG , FMC) gels, the relevant fragment was 
35 excised from the gels, and purified using QIAquick Gel 

extraction Kit (Qiagen, USA) according to the manufacturer's 
instructions. Purified fragment was ligated to gelpurified BamHI 
and Apal digested pMB333. The ligation was used to transform 
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B". subtil is DN1885. Transformed cells .were plated on LBPG agar 
plates containing 5 /xg/ml Erythromycin and incubated overnight 
at 33 °C. Clones were restreaked on similar plates and plasmids 
were isolated from overnight cultures of clones incubated at 
5 33 °C in TY with 5 ^ig/ml Erytrhomycin plasmids were isolated 
using Qiaguick plasmid kit as described by manufacturer, 
Plasmids were checked by enzymatic digestion using Apal and 
BamHI. One correct clone was termed MB339 and the plasmid 
pMB339. 

0 

4) 

We wanted to test an easy way of -monitoring that hybrid plasmids 
had formed. Thus we wanted to monitor the forced recombination 
between the two homologous genes on the two seperate plasmids. 

15 This was done by constructing a pE194 derivative having a pro- 
moter-lacking Savizyn transcr iopt ionally fused to an ORF encod- 
ing GFP . In this way there would be no expression of GF? (due to 
lack of promoter upstream of the Savizyn and thus the GFP gene) 
unless the two homologous genes had recombined and thus getting 

20 the GFP under control of the promoter otherwise present on 
pSX120. The plasmid was constructed by PGR amplifying the GFP 
gene and introducing it just downstream of the savizyn gene on 
the pM333 9. 

The GFP encoding DNA used herein is a derivative of 
25 the original wildtype gene, the derivative of GFP was cloned 
from the DNA construction of the mutant F64L-S65T-GFP which was 
constructed as described in international patent application WO 
97/11094. 

The construction of the template plasmid was de- 
30 scribed in VJO 97/11094. The DNA fragment encoding the F64L-S65T- 
GF? was PCR amplifyed from the E.coli plasmid encoding this gene 
as follows: 

1 pi of pF64L-S65T-GF? Qiaquick plasmid prep was used 
for -50 pi PCR amplification in PCR buffer (10 mM Tris-HCl, pH 
35 8.3, 50 mM KC1, 1.5 mM MgCl 2 , 0.01 % (w/v) gelatin) containing 
200 jzM of each dNTP, 2.5 units of AmpliTaq polymerase (Per- 
kin-Elmer, Cetus, USA) and 100 pmol of each primer: 
#20231 
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5 ■ -AACTGCAGAAGATGTGGACGCGC-3 1 
#101381 

5 ' -GCCCAATGCATAAACTGCATCCCTTAACTTGTTTTATTTGTATAGTTCATCCATGCCATG- 
3 » 

5 

The PCR reactions was performed using a DNA thermal 
cycler (Landgraf , Germany) . One incubation at 94°C for 1 min 
followed by forty cycles of PCR performed using a cycle profile 
of denaturation at 94°C for 1 min, annealing at 55°C for 1 min, 
10 and extension at 72 °C for 2 min. Five-^1 aliguots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % 
agarose gels (NuSieve, FMC) . . ' 

Subclonincr of PCR fragment 

15 Forty-five-^1 aliquots of the PCR products generated as 

described above were purified using QIAquick PCR purification 
kit (Qiagen, USA) according to the manufacturer's instructions. 
The purified DNA was eluted in 50 -1 of lOmM Tris-HCl, pH 8.5. 
The PCR fragment and pMB339 were digested with Clal and Nsil, 

20 gelpurified as above and ligated together. The ligation mixture 
was used to transform DN1885, transformed cells were plated on 
LBPG 5 /ig/ml Erythromycin. After IS hours of incubation green 
flourescent cells were visible as single colonies. One of these 
were analysed and saved as MB4 06 and the corresponding plasmid as 

25 pM340S. Since the GFP encoding ORF did not have an upstream pro- 
moter driving the transcription a possible explanation for the GFP 
expression could be a read through from a promoter further up- 
straem of the Savizyn-GFP transcriptional fusion, namely the pro- 
moter of the repF gene. 

30 

5) 

In order two abolish any read-through into the ORF of GFP, a tran- 
scriptional terminator was inserted between the repF gene and the 
Savizyn DNA. The terminator was amplified from the pMUTIN-4-MCS 
35 plasmid using the following primers and conditions: 

1 /il of pMUTIN-4-MCS Qiaquick plasmid prep was used for 50 /U 
PCR amplification in PCR buffer (io mM Tris-HCl, pH 8.3, 50 mM 
KC1, 1.5 mM MgCl 2/ 0.01 % (w/v) gelatin) containing 200 /zM of 
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each dNTP, 2.5 units of AmpliTaq polymerase (Perkin-Elmer , 

Cetus, USA) and 100 pmol of each primer: 

#102588 

5 1 -CGGACGGTACCGGTAATGACTCTCTAGCTTGAGGC-3 ■ 
5 #102589 

5 1 -GGGTACAGATCTCCGCGGCGCAAAAAGGCCATCCGTCAGG-3 ' 

The PCR reactions was performed using a DNA thermal cycler 
(Landgraf , Germany) . One incubation at 94°C for 60 sec followed 
by forty cycles of PCR performed using a cycle profile of dena- 

10 turation at 94°C for 30 sec, annealing at 55°C for 30 min, and 
extension at 72 °C for 1 min. Five-^1 aliquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % aga- 
rose gels (NuSieve, FMC) . 

The amplified DNA was purified as above and cloned as a Kpnl- 

15 Bglll fragment in pMB4 06 also Bglll and Kpnl digested (same proce- 
dure for gelpurifying etc. as decribed above) . The ligation mix- 
ture was used to transform DN18S5, transformed cells were plated 
on LBPG 5 Mg/rcl Erythromycin. After 13 hours of incubation cells 
were visible as single colonies. One of these were ananlysed and 

20 saved as MB4 3 0 and correspondingly the plasmid pMB43 0. This clone 
had absolutely no expression of GF? indicating that the cloned 
terminator in fact worked as anticipated. 

The resulting sequence of the entire plasmid pMB4 3 0 is listed as 
SEQ NO 1. 

25 

Establishing the K34 3 2 strain 

PL1801 Bacillus subtilis was transf ormed with the two plasmids 
pMB430 and pSX120 using approximately 1 jig of each plasmid per 100 
Ml of competent PL1S01. The transformed cells were plated onto 

30 LBPG-5 /ig/ml Erythromycin, incubated overnight at 33 °C. Next day 
colonies were restreaked onto L3PG-5 pg/ml Erythromycin + 6 iig/ml 
Chloramphenicol and incubated overnight at 33 °c. Clones were also 
grown i liquid culture TY- 5 pg/ml Erythromycin + 6 fig f ml Chloram- 
phenicol overnight at 33°C, plasmids were purified from culture 

35 broth and the existence of both pMB4 3 0 and pSX120 in the PL18 01 
strain was verified. This clone was saved and termed MB432. 
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EXAMPLE 1 



Sequence shuffling of two homologous sequences both coding for 
the subtilisin gene of Savinase 

5 The Savinase wt gene (comprised in pSX120) coding for a 

protease of the subtilisin type was chosen as the target for 
shuffling of highly homologous genes. A site-directed variant of 
the Savinase gene (hereafter called Savizyn, comprised in 
pSX222, where a large number of silent mutations giving new re- 

10 striction sites are introduced, was used as the shuffling part- 
ner to Savinase. The two genes were cloned on two separate com- 
patible plasmids according to the invention. A transcriptionally 
active Savinase gene was cloned on a plasmid based on the pUBHO 
origin coding for resistance to chloramphenicol (pSX120, fig. 

15 4). A promoter less Savizyn gene was cloned on a derivative of 
the temperature sensitive plasmid pE194 which also codes for re- 
sistance to erythromycin (pM3430) . Two transcription terminators 
were cloned upstream of the Savizyn gene to prevent any read- 
through. 

20 In pMB430 the green florescence protein (GYP) was tran- 

scriptionally fused to the Savizyn gene in order to be able to 
visually follow the in-cross and out-cross events between the 
two plasmids. 

B.subtilis strain K3432, pLISOl transformed with the two 
25 plasmids pSX120 and pM34 30, were taken through two cycles of 
temperature shifts to induce two consecutive double-cross over 
events between the homologous genes. Homologous recombination 
and plasmid hybrid formation was selected on LBPG plates at the 
restrictive temperature (47-50°C) in the presence of erythromy- 
30 cin. Under these restrictive conditions, the temperature sensi- 
tive plasmid pM3430 is forced into the stable pSX120 plasmid by 
homologous recombination in the region of Savinase/Savizyn . 
These colonies where recombination has occurred also becomes 
fluorescent as a result of the recombination between the two 
35 plasmids and concomitant activation of the Savizyn-GFP operon 
(fig. 2) . Out-cross events were screened by growing the clones 
in LB-broth at the permissive temperature of 30°C for two days 
and plating the cells on LBPG plate with chloramphenicol and 
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erythromycin. The non-fluorescent colonies were restreaJced on 
plates and taken through a se'cond round of temperature cycle as 
described above. 

After the second round of shuffling, plasmid preparations 
5 were made from 20 different clones. Competent PL1801 wa§ trans- 
formed with each of the 2 0 plasmid preps and plated on LBPG sup- 
plemented with erythromycin. The plates were replicated onto 
LBPG plates with chloramphenicol and erythromycin to identify 
unwanted clones where both pSX120 and pMB430 had been trans- 

10 formed. Clones where only the pMB430 plasmid had been trans- 
formed were selected for sequencing of the Savizyn region to 
identify if shuffling had taken place. It is evident from figure 
6 that recombination between the two genes Savinase and Savizyn 
had taken place. All of the 12 sequenced clones show sose degree 

15 of recombination and 4 of these can positively be identified • as 
being result of two consecutive double recombination events. It 
is important to note that exchange of Savinase and Savizyn se- , 
quence can only be observed if the shuffled fragment overlaps a 
restriction site that is unique for Savizyn. The four clones 

20 with a positive evidence for two double recombinations is there- 
fore the minimum number since the recombination could have oc- 
curred between two neighbouring restriction sites. 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmid pMB430 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1: 

CA7CAGTACA AGAAAGA7AC TGTA77TCA7 AA*ACACCAAC TGCAAGAAGT 7AAGGATCAC 60 

TTACAGAAGG CAAATAACCA GTTACAGAGT CGAA7ACAGC ATATGAGGTC TACGAAACCC 120 

TTTGATTATG AAAATGACCC TACAGGXTTG 77C7C7GG AC GTGAAGAGAC TGGTAGAAAG 180 

ATAT7AACTG C7GA7GAA77 7G rj\ C G C C 7 G CAAGAAACAA 7CTCTTCTGC AGAACGGA77 240 

CT7GA7GA77 ACGAAAA7A7 TAAGAGCACA GAC7A77ACA CAGAAAATCA AGAA7TAAAA 300 

AAACGTAGAG AGAGTT7GAA AGAACTAG7G AATACATGGA AAGAGGGGTA TCACGAAAAA 3 60 

AGXAAAGAGG TTAATAAATT AAAGCGAGAG AATGA7AG77 7GAATGAGCA GT7GAA7G7A 420 

TCAGAGAAAT TTCAAGC7AG TACAG7GAC7 77A7A7CG7G CTGCGAGGGC GAATTTCCC7 4 80 

GGGTTTGAGA AAGGGTTTAA TAGGCTTAAA GAGAAAT7CT T7AATGATTC CAAAT7TGAG 540 

CGTGTGGGAC AGTTTATGGA TGTTGTACAG GATAA7GTCC AGAAGGTCGA TAGAAAGCGT 600 

GAGAAACAGC GTACAGACGA TTTAGAGATG TAGAGGTACT TTTATGCCGA GAAAACTTTT 660 
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TGCGTGTGAC AGTCCTTAAA ATATACTTAG 
TAACTTTCGG TTGCAAAGCT CTAGGATTTT 
GAAATTGGAA TAAATGCGAA ATTTGAGATG 
TCTTAGATTT TTGGGGTTAT TTAGGGGAGA 
TAGGTGTCCA TTGTCCATTG TCCAAACAAA 
GGTTGTTTTT TATGTTAAAG TGAAAAAAAC 
ATAGAAAAGA AGAGAAAAAA GTTGCTGTTA 
TATTAAATAG AATCAAAGAA AAA7A7AA7A 
TAAAAAAA7A TGCAAAGGAG GAATACGGTG 
GGCATGCTGC CTATCTATCA CTAAA77TTG 
GAGCGAAAAT GTAATAAAAG AAAC7GAAAA 
A777G7777A 7A7CCACAA7 C A G C AAAA G C 
CA77CAA777 G7AG7G7C7C CA77ACA7G A 
AAAAGAGCAT TATCATA77C TAG7GA7G7A 
AA7AA77AAC AGAAGAA77G AA7GCGAC7A 
TTGTGAGA7A TATGCTTCAC A7GGACGA7C 
TGATAGTT7A 7GGCGG7G7A GA7G77GATG 
ATAAATTAA7 TAAAGAAA7G A77GAGTT7A 
GTTTAATGGA TTATGCAA7G AAGT7TAAA7 
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AGCGTAAGCG AAAGTAGTAG CGACAGCTAT 720 
TAATGGACGC AGCGCATCAC ACGCAAAAAG 780 
TTAATTAAAG ACCTTTTTGA GGTCTTTTTT 840 
AAACATAGGG GGGTACTACG ACCTCCCCCC 900 

TAAATAAATA TTGGGTTTTT AATGTTAAAA 960 

ACATG7TCGG AGGTACAGTG ATGGTTGTAG 1020 

C7T7AAGAC7 7ACAACAGAA GAAAA7GAGA 1030 

77AC7AAA7C AGA7GCAACC CGTATTCTAA 1140 

CA7777AAAC AAAAAAAGAT AGACAGCAC7 12 00 

77AAG7G7A7 7AGCACCG7T A7TATATCA7 1260 

CAAGAAAAA7 7CAAGACGAC G7AA77GGAC 1320 

CGACTGG77A GAG7AT77AA AAGAG77ACA 13S0 

7ACGGA7AC7 CA7ACAGAAG G7AGGA7GAA 1440 

TGAGGG7AA7 AAATCT7ATG AACAGATAAA 1500 

77CCGCAGA7 7GCAGGAAGT GTGAAAGG7C 1560 

C7AA7AAA77 7AAATATCAA AAAGAAGA7A 1620 

AA77A7TAAA GAAAACAACA ACAGATAGAT 1680 

T7GA7GAACA AGGAATCGTA GAATTTAAGA 1740 

T7GA7GA7TG GT7CCCGCTT TTATGTGA7A 1800 
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ACTCGGCGTA TGTTATTCAA GAATATATAA AATCAAATCG GTATAAATCT GACCGATAGA 1860 

TTTTGAATTT AGGTGTCACA AGACACTCTT TTTTCGCACC AGCGAAAACT GGTTTAAGCC 1920 

GACTGCGCAA AAGACGAATT CGAGCTCGGT ACCGGTAATG ACTCTCTAGC TTGAGGCATC 1980 

AAATAAAACG AAAGGCTCAG TCGAAAGACT GGGCCTTTCG TTTTATCTGT TGTTTGTCGG 2040 

TGAACGCTCT CCTGAGTAGG ACAAATCCGC CGC7C7AGCT AAGCAGAAGG CCATCCTGAC 2100 

GGATGGCCTT TTTGCGCCGC GGAGATCTAA ATATTCCTbG TGGCGCAAGC TTTGTACCAG 2160 

GGGAACCGTC GACTCAAGAT GGGAATGGGC A7CGCACGCA TGTCGCCGGG ACGATCGCTC 2220 

C77TAAACAA T7CGA77CGC GTTCTTGGCG 7ACCCCCGAC CGCTGAGCTA TACGCTGTTA 2280 

AAGTCCTAGG GGCGAGCGGT TCAGGTTCGG 7CAGC7CG AT TGCCCAAGCA TTGGAATGGG 2340 

CAGGGAACAA TGGCATGCAC GTTGCTAATT TGAGTTTAGG AAGCCCTTCG CCAAGTGCCA 2400 

CAC7CG AGCA AGCTG7TAAT AGCGCGAC7T CTAGACGCGT 7CTTG77GTA GCGGCATC7C 2460 

GG AA77CAGG TGCACGC7CA ATCACCTATC CGGCGCGCTA 7CCCAACCCA A7GCCAC7CG 2520 

GAGC7ACTGA TCAAAACAAC AACCGCCC7A GC7777 CACA CTATGCCGCA GGCC77GACA 2580 

T7GTCGCACC CCGGGTAAAC GTGCAGAGCA CATACCCAGS TTCAACA7AT GCCAGCTTAA 2640 

ACGG7ACATC GA7GGC7AC7 CCTCATGT7G CAGG7GCGGC CGCCC77G77 AAACAAAAGA 2700. 

ACCCATCTTG G7CTAATGTA CAAAT7CGAA A7CA7C7AAA GAA7ACGGCA ACTAG7TTAG 2760 

GAAGCACGAA CT7GTATGGA AGCGGAC7TG T7AACGCAGA AGCGGCAACG CGTTAAAAAT 2820 

GAGGAGGGAA GCTT7ATGAG TAAAGGAGAA GAAC77TTCA CTGGAG77GT CCCAATTC77 2880 

,GT7GAATTAG ATGGCGATGT TAATGGGCAA AAAT7C7C7G TTAG7GGAGA GGGTGAAGGT 2940 
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GATGCAACAT ACGGAAAACT TACCCTTAAA TTTATTTGCA CTACTGGGAA GCTACCTGTT 3000 
CCATGGCCAA CGCTTGTCAC TACTCTCTCT TATGGTGTTC AATGCTTTTC TAGATACCCA 3060 

GATCATATGA AACAGCATGA CTTTTTCAAG AGTGCCATGC CCGAAGGTTA TGTACAGGAA 3120 

AGAACTATAT TTTACAAAGA TGACGGG AAC TACAAGACAC GTGCTGAAGT CAAGTTTGAA 3180 

GGTGATACCC TTGTTAATAG AATCGAGTTA AAAGGTATTG ATTTTAAAGA AGATGGAAAC 3240 

ATTCTTGGAC ACAAAATGGA ATACAATTAT AACTCACATA ATGTATACAT CATGGCACAC 3300 

AAACCAAAGA ATGGCATCAA AGTTAACTTC AAAA7TAGAC ACAACATTAA AGA7GGAACC 33 60 

G7TCAATTAG CACACCAT7A 7CAACAAAA7 AC7CCAA77G GCG A7GGCCC TCTCCTTTTA 3420 

CCAGACAACC A7TACCTC7C CACCCAATCT CCCC777CCA AAG A7CCCAA CGAAAACAGA 3480 

GATCACATGA 7CCT7C7TG A G777GTAACA CC7CCTCGCA TTACACATCG CATCG ATGAA 3540 

CTATACAAAT AAAACAAGTT AAGCCATCCA G77TATG CAT CCCTTAACTT ACTTATTAAA 3600 

TAATTTATAG CTATTGAAAA GAG A7AJ\G AA 77C7TCAAAC CTAATATTGT TTAAATCGTC 3660 

AATTCCTGCA TGTTTTAAGG AA77C77AAA TTG ATTTTTT G7AAA7AT7T TC77CTA77C 3 7 ; 20 

TTTGTTAACC CATTTCATAA CGAAA7AAT7 ATACTTTTGT TTATCTTTCT GTG ATATTCT 3780 

TGATTTTTTT CTACTTAATC TGATAAGTGA GCTATTCACT TTACGTTTAG GATGAAAATA 3840 

TTCTCTTGGA ACCATACTTA AT AT AG AAA T ATCAAC77CT GCCATTAAAA GTAATGCCAA 3 900 

TGAGCGTTTT GTATTTAATA ATCTTTTAGC AAACCCGTAT TCCACGATTA AATAAATCTC 3960 

ATTAGCTATA CTATCAAAAA CAATTTTGCG TATTATATCC GTACTTATGT TATAAGGTAT 4020 

ATTACCATAT ATTTTATAGG ATTGGTTTTT AGGAAATTTA AACTGCAATA TATCCTTGTT 4080 
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TAAAACTTGG AAATTATCGT GATCAACAAG 
GTCTATTTCA ATGGCAGTTA CGAAATTACA 
TTTTCCTGAG CCGATTTCAA AGATATTATC 
TTTATCTATA TTATGTTTTG AAGTAATAAA 
CATTATAACC CTCTTTAATT TGGTTATATG 
ACTTATTTTT TGTTTGGTTG ATAATGAACT 
ATATTTTTTC CTCCTTATAA AATTAGTATA 
ATG ATGAGTG A7CGTTAAAT TTATACTGCA 
AGAGATTTAT CTAATTTC7? TTTTCTTGTA 
TTTGGTCGTA GAGCACACGG TTTAACGACT 
ACACTTTATC AAATCTATAT ACG7TTA7AT 
CATTCAATTT TGAGGGTTGC CAGAG77AAA 
CATAATCAAA GAGAGAATAA AAAC7A7AA7 
AAAAATTATG ATTTGATTAJa CGCACAAAAT 
ATTGATGAGA ATTATTCAGG GAAACGTAAA 
GGACTGGTTA CAAGTGATAA AGATTTCTTT 
TTTTTTAAAG ATAGCTTGGA GTTTCTAGAA 
GCGACTGTCC ATCTGGATGA AAGAGTCCCA 
GAGGACGGGA GATTGTCTGC AAAAGAACAG 
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TTTATTTTCT GTAGTTTTGC ATAATTTATG 4140 

CCTCTTTACT AATTCAAGGG TAAAATGGCC 4200 

ATGTTCATTT AATCTTATAT TTGTCATTAT * 4260 

GTTTTGACTG TGTTTTATAT TTTTCTCGTT 4320 

AATTTTGCTT ATTAACGATT CATTATAACC 4380 

GTGCTGATTA CAAAAATACT AAAAATGCCC 4440 

ATTA'TAG'Cr.C CAGCTCTGAT AAATATGAAC 4500 

ATCGGATGCG ATTATTGAA7 AAAAGATATG 4 5 60 

AAAAA.AGAAA GTTCTTAAAG GTTTTATAGT 4 620 

TAATTACGAA GTAAATAAGT CTAGTGTGT7 4680 

ATATTTATTA TCCGCAGGTG TAGCATGTCT 47 4 0 

GGATCAAGTA ATACAAACGG GATACAAAGA 4800 

AATAAAGACA TAAATCATGA GG AAA CAT AT 4 860 

ATAAAGTATA AAGATAAAAT TGATGAAACG 4920 

ATTCGGTCAG ATGCAATTCG ACATGTGGAC 4930 

GATGATTTAA GCGGAGAAGA AATAGAACGA 5040 

AATGAATACG GTAAGGAAAA TATGCTGTAT 5100 

CATATGCACT TTGGTTTTGT CCCTTTAACA 5160 

TTAGGCAACA AGAAAGACTT TACTCAATTA 5220 
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CAAGATAGAT TTAATGAGTA TGTGAATGAG AAAGGTTATG AACTTGAAAG AGGCACGTCC 5280 
AAAGAGGTTA CAGAACGAGA ACATAAAGCG ATG 5313 
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Claims 

1. A process for in vivo recombination of homologous DNA se- 
quences, comprising the following steps 

5 a) incubating a cell containing at least two DNA struc- 

tures, each comprising a DNA sequence and an origin of 
replication, under conditions which favour integration 
of one of the DNA-structures into one of the others, 
thereby forming a hybrid DNA structure; 
10 b) incubating the cell under conditions which favour cross- 

ing out from said hybrid DNA structure, thereby forming 
novel DNA structures, each 'comprising a recombined DNA 
sequence and an origin of replication; 
c) repeating steps a)-b) at least once. 

15 

2. A process for in vivo recombination of homologous DNA se- 
quences, comprising the steps 

a) incubating a cell containing at least two DNA struc- 
tures, each comprising a DNA sequence and an origin of 

20 replication, under conditions where i) at least one ori- 

gin of replication is non-functional and ii) the cell is 
under a selective pressure which only allows growth of 
the cell if it contains the DNA structure which is hav- 
ing the non-functional origin (i.e. favoring an integra- 

2S tion of one of the DNA-structures into one of the oth- 

ers, thereby forming a hybrid DNA structure); 

b) changing of said conditions to conditions, where fewer 
origins of replication are non-functional than in step 
a) (i.e. favoring a crossing out from said hybrid DNA 

30 structure) ; 

c) repeating steps a)-b) at least once. 

3. The process according to claim 1-2, characterized in that 
at least one DNA structure is a chromosome, preferred a bacte- 

35 rial chromosome, more preferred a Bacillus chromosome, espe- 
cially a Bacillus subtilis chromosome. 
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4; The process according to claim 1-3, characterized in that 
at least one DNA structure is a vector, preferred a bacterial 
vector, more preferred a vector able to replicate in a Bacillus 
cell, especially a Bacillus subtilis cell. 

5 

5. The process according to claim 4, wherein at least two of 
the DNA constructs are two different vectors. 

6. A process for in vivo recombination of homologous DNA se- 
10 quences, comprising the steps 

a) inserting the homologous DNA sequences in at least two 
different vectors, said vectors contain different origins 
of replication; 

b) transferring the vectors into a cell; 

15 c) incubating the cell under conditions where i) at least one 
origin of replication is non-functional and ii) the cell 
is under a selective pressure which only allows growth of 
the cell if it contains the DNA structure which is having 
the non-functional origin (i.e. favoring an integration of 

20 one of the DNA-structures into one of the others, thereby 

forming a hybrid DNA structure) ; 

d) changing of said conditions to conditions, where fewer 
origins of replication are non-functional than in step c) 
(i.e. favoring a crossing our from said hybrid DNA struc- 

5 ture) ; 

e) repeating steps c)-d) as many tines as wanted, however at 
least once. 

7. The process according to claim 1-6, characterised in that 
0 at least one origin of replication is selected in such a way ' 

that the replication rate of the vector can be regulated chemi- 
cally or physically. 

8. .The process according to claim 7, characterised in that at 
5 least one origin of replication is temperature sensitive. 
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9.- The process according to claim 8, characterised in that the 
replication rate of at least one vector can be regulated by a 
temperature shift. 

5 io. The process according to claim 1-9, characterised in that 
the vectors contain genes for markers, preferred selectable 
markers, especially antibiotic markers. 

11. The process according to claim 10, characterised in that the 
io vectors contain genes for different markers. 

12. The process according to any. of the preceding claims, 
wherein at least one DNA structure do not comprise ori(-) (i.e. 
only comprise Ori(+)). 



15 



13. The process according to claim 12, wherein the DNA structure 
which do not comprise Ori(-) is a DNA structure which origin 
(i.e. herein Ori( + )) is functic.-.al during the hybrid DNA struc- 
ture formation performed according a process according to any of 

20 the preceding claims. 

14. The process according to any of the preceding claims, 
wherein it is formation of the hybrid structure is measured in 
vivo, preferably by having a system of DNA structures, wherein 

25 an in vivo screenable protein is expressed only after the hybrid 
structure has been formed. 

15. The process according to claim 14, wherein said screenable 
protein is a protein which is fluorescent under suitable expo- 

30 sure, and in particular said fluorescent protein is Green Flo- 
rescent protein (GFP) or variants thereof. 

16. The process according to any of the preceding claims, 
wherein crossing out from the hybrid DNA structure is a forced 

35 crossing out event. 

17. The process according to claim 16, wherein the forced cross- 
ing out from the hybrid structure is performed by: 



WO 98/23416 PCT/DK97/00567 

41 



i) constructing a system wherein the DNA hybrid struc- 
ture do not express a selectable marker, and at 
least one of the at least two DNA structures accord- 
ing to any of the preceding claims is able to ex- 

5 press said selectable marker; 

ii) incubating the cell under selective conditions, 
thereby forcing a cross out of the DNA hybrid struc- 
ture, since the cell is only able to express the 
marker gene after the crossing out of the DNA hybrid 

10 structure has taken place. 



18. The process according to 17, wherein thp selectable marker 
in step i) is expressed on a DNA structure where the origin of 
replication is non-functional during a forced integration of the 

15 DNA structures into a DNA hybrid structure performed as de- 
scribed above, and preferably the selectable marker gene is an 
antibiotic resistance gene. 

19. The process according to any of the preceding claims, 

20 wherein the homologous DNA sequences comprise DNA sequences from 
at least one pre-made DNA library. 



20. The process according to any of the preceding claims, 
wherein the homologous DNA sequences are comprised of two or 

25 more different set of homologous DNA sequences ("set A" "set B", 
"setC" etc.), and wherein each of said set of homologous se- 
quences ("set A" "set B" , "setC" etc.) are inserted in a differ- 
ent DNA construct ("DNA construct A", "DNA construct B", "DNA 
construct C" etc..) according to any of the preceding claims. 

30 

21. The process according to claim 20, wherein at least one of 
said sets of homologous DNA sequences is derived from a pre-made 
DNA library, and at least one of said sets of homologous DNA se- 
quences is a DNA sequence which are homologous to said DNA li- 

35 brary (i.e. resulting in a process for in vivo recombination of 
said DNA library to a DNA sequence which are homologous to said 
DNA library) . 
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22. The process according to 20, wherein said sets of homologous 
DNA sequences is derived from the same pre-made DNA library 
(i.e. resulting in a process for in vivo recombination of said 
DNA library) . 

23. The process according to claim 20, wherein said sets of ho- 
mologous DNA sequences are derived from at least two different 
pre-made DNA libraries (i.e. resulting in a process for in vivo 
recombination of at least two different DNA libraries) . 

24. The process according to claim 20, wherein said sets of ho- 
mologous DNA sequences each only -contain one DNA sequence (i.e. 
recombination of two or more homologous sequences) . 

25. The process according to any of the preceding claims , 
wherein two sets of homologous DNA sequences ("set A' 1 and M set 
B") are recombined in vivo, comprising the steps 

a) inserting sequence (s) from "set A M in a vector contain- 
ing a temperature resistant origin of replication and a 

0 gene for a first marker; 

b) inserting sequence(s) from "set B u in a vector contain- 
ing a temperature sensitive origin of replication and a 
gene for a second r.arker; 

c) transferring the vectors into a cell; 

5 d) incubating the cell at a temperature where both origins 

of replication are functional, optionally under a selec- 
tion pressure that favours cells with both marker genes; 

e) shifting the temperature to a temperature, where only 
the temperature resistant origin of replication is func- 

o tional; 

f) repeating steps d)-e) at least once. 

26. The process according to any of the preceding claims, 
wherein two sets of homologous DNA sequences ("set A" and "set 
15 B") are recombined in vivo , comprising the steps 

a) inserting sequence(s) from "set A" in a vector containing a 
temperature resistant origin of replication and a gene for a 
first marker; 
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b) " inserting sequence (s) from "set B" in a vactor contain- 

ing a temperature sensitive origin of replication and a 
gene for a second marker; 

c) transferring the vectors into a cell; 

d) incubating the cell at a temperature where only the tem- 
perature resistant origin of replication is functional, 
optionally under a selection pressure that favours cells 
with both marker genes; 

e) optionally, shifting the temperature to a temperature, 
where both origins of replication are functional; 

f) repeating steps d)-e) at least once. 



27. The process according to claim 1-26, characterised in that 
the cell containing the DNA structures is defective or transito- 

15 rily inactivated in the enzymatic mismatch repair system or has 
a reduced level of mismatch repair. 

28. The process according to claim 1-27, characterised in that 
the cell is a Gran positive cell, such as a Staphylococcus cell, 

20 a Streptococcus ceil, a Bacillus cell, or 4 n particular a Bacil- 
lus subtilis cell. 

29. The process according to claim 1-28, characterised in that 
the cell is a Gram negative cell, preferred an Escherichia cell, 

25 especially an E. coli cell. 

30. The process according to claim 1-29, characterised in that 
the homologous DNA sequences are encoding polypeptides, espe- 
cially enzymes, and in particular enzymes such as an amylase, 

30 lipase, cutinase, cellulase, oxidase, phytase, and a protease. 

31. A method for producing one or more recombinant protein (s) 
having a desired biological activity comprising: 

a) * performing a process for in vivo recombination of 

35 homologous DNA sequences according to any of the preceding 

claims, wherein at least one of the DNA structure (s) is an 
expression vector; 
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b) - expressing the numerous different recombinant polypeptides 

encoded by the numerous different recombined DNA sequences 

from step a) ; and 

c) screen or select the numerous different recombinant 
proteins from step b) in a suitable screening or selection 
system for one or more recombinant polypeptides (s) having 
a desired activity. 

32. A set of at least two vectors, where at least two vectors 
are comprising at least two individually different origins of 
replication able to function in the same cell under permissive 
conditions, and two homologous DNA' sequences encoding polypep- 
tides, said DNA sequences having more than 50% homology, prefer- 
able more than 70%. 

33. A vector system, comprising at least 

a) a vector A containing a DNA sequence encoding a polypep- 
tide and an origin of replication 

b ) a vector B containing a DNA sequence homologous to the 
? DNA sequence in vector A, and an origin of replication 

that differs from the origin of replication in vector A. 

34. The vector system according to claim 33, characterised in 
that the DNA sequences encodes polypeptides with a biological 

5 activity, especially enzymes. 



35, 



A cell containing a vector system according to claim 33-34 



36 The cell according to claim 35, characterised in that is a 
>o Gran positive cell, preferred a Bacillus cell, especially a Ba- 
cillus subtilis cell. 

37. The cell according to claim 35, characterised in that the 
cell. is a Gram negative cell, preferred an Escherichia cell, es- 

35 pecially an E. coli cell. 

38. A recombined DNA sequence, characterised in that it is a 
result of a process according to one or more of the claims 1-30, 
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or it is encoding a recombinant protein having a desired bio- 
logical activity, wherein said recombinant protein having a de- 
sired biological is produced by the method according to claim 
31. 

39. A process for producing a polypeptide, especially an en- 
zyme, comprising i) growing a cell expressing a polypeptide en- 
coded by a recombined DNA sequence according to claim 38, and 
ii) purifying said expressed polypeptide. 

40. A polypeptide, especially an enzyme, characterised in that 
it is produced by a process according to claim 39. 

41. A polypeptide, especially an enzyme, characterised in that 
it is encoded by the recombined DNA sequence according to clain 
38 . 
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